Goto

Collaborating Authors

 above example


Learning from Mixtures of Private and Public Populations

Neural Information Processing Systems

We initiate the study of a new model of supervised learning under privacy constraints. Imagine a medical study where a dataset is sampled from a population of both healthy and unhealthy individuals. Suppose healthy individuals have no privacy concerns (in such case, we call their data ``public'') while the unhealthy individuals desire stringent privacy protection for their data. In this example, the population (data distribution) is a mixture of private (unhealthy) and public (healthy) sub-populations that could be very different. Inspired by the above example, we consider a model in which the population $\cD$ is a mixture of two possibly distinct sub-populations: a private sub-population $\Dprv$ of private and sensitive data, and a public sub-population $\Dpub$ of data with no privacy concerns.


Learning from Mixtures of Private and Public Populations

Neural Information Processing Systems

We initiate the study of a new model of supervised learning under privacy constraints. Imagine a medical study where a dataset is sampled from a population of both healthy and unhealthy individuals. Suppose healthy individuals have no privacy concerns (in such case, we call their data public'') while the unhealthy individuals desire stringent privacy protection for their data. In this example, the population (data distribution) is a mixture of private (unhealthy) and public (healthy) sub-populations that could be very different. Inspired by the above example, we consider a model in which the population \cD is a mixture of two possibly distinct sub-populations: a private sub-population \Dprv of private and sensitive data, and a public sub-population \Dpub of data with no privacy concerns.


Indo LEGO-ABSA: A Multitask Generative Aspect Based Sentiment Analysis for Indonesian Language

Suchrady, Randy Zakya, Purwarianti, Ayu

arXiv.org Artificial Intelligence

Aspect-based sentiment analysis is a method in natural language processing aimed at identifying and understanding sentiments related to specific aspects of an entity. Aspects are words or phrases that represent an aspect or attribute of a particular entity. Previous research has utilized generative pre-trained language models to perform aspect-based sentiment analysis. LEGO-ABSA is one framework that has successfully employed generative pre-trained language models in aspect-based sentiment analysis, particularly in English. LEGO-ABSA uses a multitask learning and prompting approach to enhance model performance. However, the application of this approach has not been done in the context of Bahasa Indonesia. Therefore, this research aims to implement the multitask learning and prompting approach in aspect-based sentiment analysis for Bahasa Indonesia using generative pre-trained language models. In this study, the Indo LEGO-ABSA model is developed, which is an aspect-based sentiment analysis model utilizing generative pre-trained language models and trained with multitask learning and prompting. Indo LEGO-ABSA is trained with a hotel domain dataset in the Indonesian language. The obtained results include an f1-score of 79.55% for the Aspect Sentiment Triplet Extraction task, 86.09% for Unified Aspect-based Sentiment Analysis, 79.85% for Aspect Opinion Pair Extraction, 87.45% for Aspect Term Extraction, and 88.09% for Opinion Term Extraction. Indo LEGO-ABSA adopts the LEGO-ABSA framework that employs the T5 model, specifically mT5, by applying multitask learning to train all tasks within aspect-based sentiment analysis.


Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

Guo, Yuting, Rajwal, Swati, Lakamana, Sahithi, Chiang, Chia-Chun, Menell, Paul C., Shahid, Adnan H., Chen, Yi-Chieh, Chhabra, Nikita, Chao, Wan-Ju, Chao, Chieh-Ju, Schwedt, Todd J., Banerjee, Imon, Sarker, Abeed

arXiv.org Artificial Intelligence

Migraine is a high-prevalence and disabling neurological disorder. However, information migraine management in real-world settings could be limited to traditional health information sources. In this paper, we (i) verify that there is substantial migraine-related chatter available on social media (Twitter and Reddit), self-reported by migraine sufferers; (ii) develop a platform-independent text classification system for automatically detecting self-reported migraine-related posts, and (iii) conduct analyses of the self-reported posts to assess the utility of social media for studying this problem. We manually annotated 5750 Twitter posts and 302 Reddit posts. Our system achieved an F1 score of 0.90 on Twitter and 0.93 on Reddit. Analysis of information posted by our 'migraine cohort' revealed the presence of a plethora of relevant information about migraine therapies and patient sentiments associated with them. Our study forms the foundation for conducting an in-depth analysis of migraine-related information using social media data.


Working with Text -Part 4. Techniques in handling text data

#artificialintelligence

Example: 'I want to read a book' In the above example there are 6 tokens which are- ('I', 'want, 'to', 'read', 'a' and'book') A type is the class of all tokens containing the same character sequence. In the above example, there are only 5 types which are - 'can, 'you', 'a, 'as' and'canner' as'can', 'as' and'a' are being repeated. In the above example, by deleting period and hyphens between the characters and words we are normalizing the type by making it a term. So the term in the above example is: 'USA' and'antiinflammatory' Example: "Hello everyone.Welcome to the course." The tokens for the given sentence will be -- ['Hello','everyone', 'Welcome', 'to', 'the', 'course'] Welcome to the Natural Language Processing course.


a-guide-to-rasa-and-rasa-x

#artificialintelligence

I hope you read and enjoyed my previous blog titled'Introduction to Rasa X' since it is a precursor to this one. In case you haven't, you can read it here. In this blog, I am going to lead you through the installation, folder structure, controls, and features of Rasa as well as Rasa X to develop an assistant. Let's first dive into installing Rasa. To install Rasa, you require Python 3.7 or Python 3.8.


Understanding Tree Models

#artificialintelligence

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. Life is full of decisions and eventually, we do measure which option to take on some logical-based analysis.


Preliminaries To Machine Learning

#artificialintelligence

A lot of us want to dive deep into the ocean of Machine Learning. We discover our routes in the ocean by utilizing resources like YouTube, Google, and good old books, but most of us fail to understand the underlying concepts and basics that every beginner should be aware of. This article aims to cover these preliminaries that should be the first step in your journey of becoming a Samurai in Machine Learning. Learning itself is a very complex concept to define precisely. Psychologists and Zoologists have been struggling to understand how animals and humans learn.


Your Complete Guide To Image Segmentation

#artificialintelligence

Computer vision has advanced rapidly over the last few years. At its core, computer vision is the technology that allows machines to process their surroundings as humans do. While the human brain is naturally capable of multi-tasking and making quick decisions, transferring this capability to machines was a challenge in the beginning. However, today we have been able to build computer vision models that can detect objects, determine shapes, predict object movements, and take necessary actions based on data. Self-driving cars, ariel mapping, surveillance applications, and various other AR/VR technologies we enjoy today are a result of the progress made in computer vision models.


Understand Weight of Evidence and Information Value! - Analytics Vidhya

#artificialintelligence

We have all built a logistic regression at some point in our lives. Even if we have never built a model, we have definitely learned this predictive model technique theoretically. Two simple, undervalued concepts used in the preprocessing step to build a logistic regression model are the weight of evidence and information value. I would like to bring them back to the limelight through this article. First thing first, we all know logistic regression is a classification problem.